B/448946 



Carboxy terminus 
of a-factor; 



Possible codons 
and their usage 

Consensus 

ol igonucleotides: 

Synthesized 
ol igonucleotide 
pools complemen- 
tary to above: 



Gly - Gin _ Pro - Met ~ Tyr COOH 
SGG (0) CCG (0) 



5'-ggJcaaccJatgtac 



I. 5 '-GTACATTGGTTG.ee 

o 

II. 5 ' -GTACATAGGTTG^CC 

O 



FIGURE 1 



K/448946 



CGACA6TAAATTTTGCCGAATTTAATAGCTTCTACT6AAAAACAGTGGACCATGTGAAAAGATGCATCTCATTTATCAA 
-280 -260 .240 -220 

ACACATAATATTCAAGTGAGCCTTACTTCAATTGTATTGAAGTGCAAGAAAACCAAAAAGCAACAACAGGTTTTGGATA 
-2&0 -1B0 -150 -170 

AGTACATATATAAGAGGGCCTTTTGTTCCCATCAAAAATGTTACTGTTCTTACGATTCATTTACGATTCAAGAATAGTT 
-170 -ltfO -10 .#0 

1 

Met Arg Phe Pro Ser He 

CAAACAAGAAGATTACAAACTATCAATTTCATACACAATATAAACGATTAAAAGA ATG AGA TTT CCT TCA ATT 
-40 -20 1 



20 



Phe Thr Ala Val Leu Phe Ala Ala Ser Ser Ala leu Ala Ala Pro Val Asn Thr Thr Thr 
TTT ACT GCA GTT TTA TTC GCA GCA TCC TCC GCA TTA GCT GCT CCA GTC AAC ACT ACA ACA 
20 40 60 



40 

Glu Asp Glu Thr Ala Gin He Pro Ala Glu Ala Val He Gly Tyr Leu Asp Leu Glu Gl y 
GAA GAT GAA ACG GCA CAA ATT CCG GCT GAA GCT GTC ATC GGT TAC TTA GAT TTA GAA GGG 
80 100 ;120 



Asp Phe Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu Phe He 

GAT TTC GAT GTT GCT GTT TTG CCA TTT TCC AAC AGC ACA AAT AAC GGG TTA TTG TTT ATA 
140 160 180 

mmmm^mm 80 

Asn Thr Thr He Ala Ser He Ala Ala Lys Glu Glu Gly Val Ser Leu Asp Lys Arg Glu 

AAT ACT ACT ATT GCC AGC ATT GCT GCT AAA GAA GAA GGG GTA TCT TTG GAT AAA AGA GAG 
200 220 240 



Ala Glu Ala 
GCT GAA GCT 

260 



100 

Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 
TGG CAT TGG TTG CAA CTA AAA CCT GGC CAA CCA ATG TAC 



280 



300 



Lys Arg Glu Al a 
AAG AGA GAA GCC 



Glu Ala Glu Ala 
GAA GCT GAA GCT 

320 



120 

Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 
TGG CAT TGG CTG CAA CTA AAG CCT GGC CAA CCA ATG TAC 



340 



Lys Arg Glu 
AAA AGA GAA 



360 



Ala Asp Ala Glu Ala 
GCC GAC GCT GAA GCT 

380 



140 

Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 
TGG CAT TGG CTG CAA CTA AAG CCT GGC CAA CCA ATG TAC 
***** 



400 



Lys Arg 
AAA AGA 



420 



Glu Ala Asp Ala Glu Ala 
GAA GCC GAC GCT GAA GCT 

440 



160 165 
Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 
TGG CAT TGG TTG CAG TTA AAA CCC GGC CAA CCA ATG TAC 



End 
TAA 



460 



480 



GCCC6ACTGATAACAAWGTGTAGATGTAACAAAGTCGACTTTGTTCCCACTGTACTTTTAGCTCGTACAAAATACAAT 
5B0 570 5?0 5?0 



ATACTTTTCATTTCTCCGTAAACAAC flT GTTTTCCCATGTAATATCCTTTTCTATT 
5*50 6$0 620 



lauu LL.U 1 I MLLMML I I I AL 

640 



ACATACTTTATATAGCTATTCACTTCTATACACTAAAAAACTAAGACAATTTTAATTTTGCTGCCTGCCATATTTCAAT 
6E0 650 7150 770 

TTG7TATAAATTCCTATAATTTATCCTATTAGTAGCTAAAAAAAGATGAATGTGAATCGAATCCTAAGAGAATTC 
7*0 750 750 8U0 



FIGURE 3 



J?, 



TTCTTCATTGGTACATCAATGCCAGCAACGATGTGCGCATCTGGGCGACGCCTG7AGTGATTGTTTTCAAGGTATCGAG 
-3ff0 -250 -250 -270 

CCAAACTATTCATCGTTACTGTTTCAAATATTCAGTTGTTTCAGTACAGAGTCGCCGTGGACCTAGTGAAACTTGGTGT 
-2?0 -250 -l50 -150 

CTTTACAGCGCAGAGACGAGGGCTTATATGTATAAAAGCTGTCCTTGATTCTGGTGTAGTTTGAGGTGTCCTTCCTATA 
-140 -170 -Iffo -50 

TCTGTTTTTATATTCTATATAATGGATAATTACTACCATCACCTGCATCAMTTCCAGTAAATTCACATATTGGAGAAA 
-60 -40 .20 

1 10 20 

Met Lys Phe lie Ser Thr Phe Leu Thr Phe He Leu Ala Ala Val Ser Val Thr Ala Ser 

ATG AAA TTC ATT TCT ACC TTT CTC ACT TTT ATT TTA GCG GCC GTT TCT GTC ACT GCT AGT 

1 20 40 60 

30 40 

Ser Asp Glu Asp lie Ala Gin Val Pro Ala Glu Ala lie He- Gly Tyr Leu Asp Phe Gly 

TCC GAT GAA GAT ATC GCT CAG GTG CCA GCC GAG GCC ATT ATT* GGA TAC TTG GAT TTC GGA 
80 100 120 



50 



60 



Gly Asp His Asp lie Ala Phe Leu Pro Phe Ser Asn Ala Thr Ala Ser Gly Leu Leu Phe 
GGT GAT CAT GAC ATA GCT TTT TTA CCA TTC AGT AAC GCT ACC GCC AGT GGG CTA TTG TTT 

140 160 180 



70 



80 



lie Asn Thr Thr He Ala Glu Ala Ala Glu Lys Glu Gin Asn Thr Thr Leu Ala Lys Arg 
ATC AAC ACC ACT ATT GCT GAG GCG GCT GAA AAA GAG CAA AAC ACC ACT TTG GCG AAA AGA 
200 220 240 



Glu Ala Val Ala Asp Ala 
GAG GCT GTT GCC GAC GCT 



90 

Trp His Trp Leu Asn Leu Arg Pro Gly Gin Pro Met Tyr 
TGG CAC TGG TTA AAT TTG AGA CCA GGC CAA CCA ATG TAC 



100 
Lys 
AAG 



260 



280 



300 



Arg Glu Ala Asn Ala Asp Ala 
AGA GAG GCC AAC GCT GAT GCT 



110 



120 



Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr 
TGG CAC TGG TTG Ttt CTC TOE" CCA GGC CAA CCA ATG TAC 



320 



340 



360 



End 

TGA AAAATGACCCTAAACTACTTCTAAACCCTn"CGATTrCTrTCACGTTCATACAACACCTAGTTTTATTTATTTTC 
350 450 470 

TTTTCAATCTGAGTAGTTGAGTrnCGATCACTCACATAGAACTATTTTTTGCCATTTAAATAAAGTATTCTCTCAAAT 
470 450 450 SCO 

GATGCGATACTATAATACTCTnGCCATATAnACATTCATTCATAAATAGGCTATGTTTCTATATCCGTTTCCGATTC 
570 5?0 550 550 



TGTCTGCAAGCAAGGTTCCCTATCATTACCGGATTGTTCACTATGGTTGGAGCTC 
6fl0 fi70 SlO 



FIGURE k 
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4 



ALPHA-FACTOR "PRE-PRO" SEQUENCE 
...LYS ARG GLU ALA GLU ALA 
...AAA AGA GAG GCT GAA GCT 
...TTT TCT CTC CGA CTT CGA 



MODIFIED I FN- ALPHA].. GENE 
LEU ' GLU PHE MET. . . 
CTA GAA TTC ATG... 
GAT CT.T A AG , TAC . . . 



Xbal 



EcoRI 



FIGURE 7 




FIGURE 8 




FIGURE 9 



EcoR 




BomH\ 



Portial EcoRl 
digestion 

Isolate I800bp 
fragment 



aG cTC X 

A A 

C T 
G r aC 
A G G"T 

— TCTTTGGATAA A AGA TGTGATCTCCCTGAG — 
3' HO-AACCTATTTTCTACACTAGAGGGA-p 5' 




t 




DNA polymerase (Klenow] 
TA DNA ligase 
dNTPs , ATP 



Transform £". co// 



MF a\ promoter 
ond 

pre-pro" sequence 



modified 
IFN-al 
gene 



£c<?Ri 



I230bp 



'/S/S/SS/s/sX 



570bp 



A&al ffcoRI 



Ligate/to £toRI-cut MI3mp8 ds DNA 
Transform Eco// 

Determine orientation of insert in 
recombinant phage (white plaques) 

Prepare single-stranded template DNA 
from phage 

Anneal with synthetic oligonucleotide 



Screen plaques with 32 P-labeled 
mutagenic oligonucleotide 
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1 10 20 

MFal MetArgPheProSerll ePheThrAl aValLeuPheAl aAl aSerSerAI aLeuAl aAl a 

Wa2 MetLysPhelleSerThrPheLeuThrPhe-- IleLeuAlaAla 

Consensus Met — Phe — Ser — Phe LeuAlaAl a 

30 40 
MFal ProValAsnThr — ThrThrGl uAspGl uThrAl aGl nil eProAlaGluAl aVal lie 
,MFa2 ValSerValThrAlaSerSerAspGluAspIleAlaGlnValProAlaGluAlallelle 
Consensu s Thr Al aGl n — ProAl aGl uAl a — II e 

50 ; 60 
MFal GlyTyrLeuAspLeuGl uGl yAspPheAspValAI aValLeuProPheSerAsnSerThr 
MFa2 GlyTyrLeuAspPheGlyGlyAspHisAspIleAlaPheLeuProPheSerAsnAlaThr 
Consensus GlyTyrLeuAsp GlyAsp — Asp — A1a : — LeuProPheSerAsn — Thr 

70 80 
MFal AsnAsnGlyLeuLeuPhell eAsnThrThrll eAl aSerll eAl aAl aLysGl uGl uGl y 
MTo2 AlaSerGlyLeuLeuPhell eAsnThrThrll eAl aGl uAl aAl aGluLysGluGlnAsn 
Consensus GlyLeuLeuPhell eAsnThrThrll eAl a Al a — LysGlu 

90 100 

MFal Val SerLeuAspLysArgGl uAl aGl u Al aTrpHi sTrpLeuGl nLeuLysPro 

MFa2 ThrThrLeuAl aLysArgGl uAl aValAl aAspAl aTrpHi sTrpLeuAsnLeuArgPro 
Consensus Leu — LysArgGl uAl a Al aTrpHi sTrpLeu — Leu — Pro 

110 120 
MFal Gl yGl nProMetTyrLysArgGl uAl aGl uAl aGl uAl aTrpHi sTrpLeuGl nLeuLys 

MF a 2 Gl yGl nProMetTyrLysArgGl uAl aAsnAl aAspAl aTrpHi sTrpLeuGlnLeuLys 

Consensus GlyGl nProMetTyrLysArgGl uAl a — Ala — Al aTrpHi sTrpLeuGlnLeuLys 

130 140 
MFo 1 ProGl yGl nProMetTyrLysArgGl uAl aAspAl aGl uAl aTrpHi sTrpLeuGl nLeu 

MFa2 ProGlyGlnProMetTyr 

Consensus ProGlyGlnProMetTyr 

150 160 
MFal LysProGlyGlnProMetTyrLysArgGluAlaAspAl aGl uAl aTrpHi sTrpLeuGl n 
Consensus 

168 

MFal LeuLysProGlyGl nProMetTyr 
Consensus 



FIGURE 12 



